Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition

نویسندگان

Kei Hashimoto

Heiga Zen

Yoshihiko Nankaku

Akinobu Lee

Keiichi Tokuda

چکیده

Decision tree based context clustering [Young; '94] ・ Construct a parameter tying structure ・ Can estimate robust parameter ・ Can generate unseen context dependent models ・ Minimum description length (MDL) criterion [Shinoda; '97] Bayesian approach ・ Variational Bayesian (VB) method [Attias; '99] ⇒ Applied to speech recognition [Watanabe; '04] ・ Can use prior information ⇒ Affect context clustering Problems Prior information is not generally given ⇒ Prior distribution becomes tuning parameters

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition

Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...

متن کامل

Continuous Speech Recognition Based on General Factor Dependent Acoustic Models

This paper describes continuous speech recognition incorporating the additional complement information, e.g., voice characteristics, speaking styles, linguistic information and noise environment, into HMM-based acoustic modeling. In speech recognition systems, contextdependent HMMs, i.e., triphone, and the tree-based context clustering have commonly been used. Several attempts to utilize not on...

متن کامل

Using Hybrid HMM/BN Acoustic Models: Design and Implementation Issues

In recent years, the number of studies investigating new directions in speech modeling that goes beyond the conventional HMM has increased considerably. One promising approach is to use Bayesian Networks (BN) as speech models. Full recognition systems based on Dynamic BN as well as acoustic models using BN have been proposed lately. Our group at ATR has been developing a hybrid HMM/BN model, wh...

متن کامل

Speaker adaptation in noisy environments based on parameter estimation using uncertain data

This paper describes new method for the speaker adaptation of HMM parameters in environments with background noise. This method is based on Bayesian estimation, and calculates the a posteriori distribution of cleanspeech HMM parameters from their a priori distribution by using noisy speech observations. The advantage of the method is that the distribution of the noise can be taken into account ...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition

نویسندگان

چکیده

منابع مشابه

Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition

Continuous Speech Recognition Based on General Factor Dependent Acoustic Models

Using Hybrid HMM/BN Acoustic Models: Design and Implementation Issues

Speaker adaptation in noisy environments based on parameter estimation using uncertain data

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

عنوان ژورنال:

اشتراک گذاری